The Necessity of Average Rewards in Cooperative Multirobot Learning

نویسندگان

  • Poj Tangamchit
  • John M. Dolan
  • Pradeep K. Khosla
چکیده

Learning can be an effective way for robot systems to deal with dynamic environments and changing task conditions. However, popular singlerobot learning algorithms based on discounted rewards, such as Q learning, do not achieve cooperation (i.e., purposeful division of labor) when applied to task-level multirobot systems. A tasklevel system is defined as one performing a mission that is decomposed into subtasks shared among robots. In this paper, we demonstrate the superiority of average-reward-based learning such as the Monte Carlo algorithm for task-level multirobot systems, and suggest an explanation for this superiority.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Reward and Diversity in Multirobot Foraging

This research seeks to quantify the impact of the choice of reward function on behavioral diversity in learning robot teams The methodology developed for this work has been applied to multirobot forag ing soccer and cooperative movement This paper focuses speci cally on results in multirobot forag ing In these experiments three types of reward are used with Q learning to train a multirobot team...

متن کامل

Crucial factors affecting cooperative multirobot learning

Cooperative decentralized multirobot learning refers to the use of multiple learning entities to learn optimal solutions for an overall multirobot system. We demonstrate that traditional single-robot learning theory can be successfully used with multirobot systems, but only under certain conditions. The success and the effectiveness of single-robot learning algorithms in multirobot systems are ...

متن کامل

Crucial Factors in Cooperative Multirobot Learning

Cooperative decentralized multirobot learning refers to the use of multiple learning entities to learn optimal solutions for an overall multirobot system. We demonstrate that traditional single-robot learning theory can be successfully used with multirobot systems, but only under certain conditions. The success and the effectiveness of single-robot learning algorithms in multirobot systems are ...

متن کامل

Increase In Activity And Learning Outcomes In Pharmacy Mathematics With Jigsaw Cooperative Learning Model At Pharmacy Academy Of Dwi Farma

Introduction: In Pharmacy Diploma Program, mathematics is known as pharmaceutical mathematics. Due to the importance of pharmaceutical mathematics in practice, it is important to have a basic mathematical skill as a basis in calculations in pharmaceutical science. Therefore, it is necessary to create a lecturing condition that enables students more active in understanding the lessons. This rese...

متن کامل

The effect of constructivist-based approach of teaching in science Courses on cooperative learning of Secondary school students and its sustainability over time

Introduction: The results of international research evaluating academic achievement, which studies the process of teaching experimental sciences, have shown that Iran’s rank is lower than average results. Therefore, the special attention to the course of experimental sciences is the essential and obvious need. In this regard, the purpose of this study was to investigate the effect of teaching...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2002